Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 1586614 |
| Missing cells | 68148 |
| Missing cells (%) | 0.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 157.4 MiB |
| Average record size in memory | 104.0 B |
Variable types
| NUM | 9 |
|---|---|
| CAT | 4 |
brewery_name has a high cardinality: 5742 distinct values | High cardinality |
review_profilename has a high cardinality: 33387 distinct values | High cardinality |
beer_style has a high cardinality: 104 distinct values | High cardinality |
beer_name has a high cardinality: 56857 distinct values | High cardinality |
beer_abv has 67785 (4.3%) missing values | Missing |
Reproduction
| Analysis started | 2020-10-30 05:38:17.627596 |
|---|---|
| Analysis finished | 2020-10-30 05:39:36.972371 |
| Duration | 1 minute and 19.34 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
brewery_id
Real number (ℝ≥0)
| Distinct | 5840 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3130.099202 |
|---|---|
| Minimum | 1 |
| Maximum | 28003 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 143 |
| median | 429 |
| Q3 | 2372 |
| 95-th percentile | 16866 |
| Maximum | 28003 |
| Range | 28002 |
| Interquartile range (IQR) | 2229 |
Descriptive statistics
| Standard deviation | 5578.103987 |
|---|---|
| Coefficient of variation (CV) | 1.782085368 |
| Kurtosis | 3.408354127 |
| Mean | 3130.099202 |
| Median Absolute Deviation (MAD) | 366 |
| Skewness | 2.083747568 |
| Sum | 4966259215 |
| Variance | 31115244.1 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 35 | 39444 | 2.5% | |
| 10099 | 33839 | 2.1% | |
| 147 | 33066 | 2.1% | |
| 140 | 28751 | 1.8% | |
| 287 | 25191 | 1.6% | |
| 132 | 24083 | 1.5% | |
| 1199 | 20004 | 1.3% | |
| 345 | 19479 | 1.2% | |
| 220 | 16837 | 1.1% | |
| 30 | 16107 | 1.0% | |
| Other values (5830) | 1329813 | 83.8% |
| Value | Count | Frequency (%) | |
| 1 | 1357 | 0.1% | |
| 2 | 40 | < 0.1% | |
| 3 | 5357 | 0.3% | |
| 4 | 7321 | 0.5% | |
| 5 | 728 | < 0.1% |
| Value | Count | Frequency (%) | |
| 28003 | 2 | < 0.1% | |
| 28000 | 1 | < 0.1% | |
| 27984 | 1 | < 0.1% | |
| 27980 | 3 | < 0.1% | |
| 27945 | 1 | < 0.1% |
| Distinct | 5742 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 15 |
| Missing (%) | < 0.1% |
| Memory size | 12.1 MiB |
| Boston Beer Company (Samuel Adams) | 39444 |
|---|---|
| Dogfish Head Brewery | 33839 |
| Stone Brewing Co. | 33066 |
| Sierra Nevada Brewing Co. | 28751 |
| Bell's Brewery, Inc. | 25191 |
| Other values (5737) |
| Value | Count | Frequency (%) | |
| Boston Beer Company (Samuel Adams) | 39444 | 2.5% | |
| Dogfish Head Brewery | 33839 | 2.1% | |
| Stone Brewing Co. | 33066 | 2.1% | |
| Sierra Nevada Brewing Co. | 28751 | 1.8% | |
| Bell's Brewery, Inc. | 25191 | 1.6% | |
| Rogue Ales | 24083 | 1.5% | |
| Founders Brewing Company | 20004 | 1.3% | |
| Victory Brewing Company | 19479 | 1.2% | |
| Lagunitas Brewing Company | 16837 | 1.1% | |
| Avery Brewing Company | 16107 | 1.0% | |
| Other values (5732) | 1329798 | 83.8% |
Frequencies of value counts
Unique
| Unique | 672 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 66 |
|---|---|
| Median length | 23 |
| Mean length | 23.61012761 |
| Min length | 3 |
review_time
Real number (ℝ≥0)
| Distinct | 1577960 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1224089280 |
|---|---|
| Minimum | 840672001 |
| Maximum | 1326285348 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 840672001 |
|---|---|
| 5-th percentile | 1071431292 |
| Q1 | 1173224188 |
| median | 1239202882 |
| Q3 | 1288568405 |
| 95-th percentile | 1318389924 |
| Maximum | 1326285348 |
| Range | 485613347 |
| Interquartile range (IQR) | 115344217 |
Descriptive statistics
| Standard deviation | 76544274.54 |
|---|---|
| Coefficient of variation (CV) | 0.06253161088 |
| Kurtosis | -0.3136982976 |
| Mean | 1224089280 |
| Median Absolute Deviation (MAD) | 54219357.5 |
| Skewness | -0.7352727768 |
| Sum | 1.942157189e+15 |
| Variance | 5.859025965e+15 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1101772800 | 21 | < 0.1% | |
| 1031101200 | 8 | < 0.1% | |
| 926380801 | 8 | < 0.1% | |
| 897091201 | 7 | < 0.1% | |
| 1022112001 | 7 | < 0.1% | |
| 980812801 | 7 | < 0.1% | |
| 904867201 | 6 | < 0.1% | |
| 933033601 | 6 | < 0.1% | |
| 902966401 | 6 | < 0.1% | |
| 926294401 | 5 | < 0.1% | |
| Other values (1577950) | 1586533 | > 99.9% |
| Value | Count | Frequency (%) | |
| 840672001 | 1 | < 0.1% | |
| 884390401 | 1 | < 0.1% | |
| 884649601 | 1 | < 0.1% | |
| 885340801 | 1 | < 0.1% | |
| 885427201 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1326285348 | 1 | < 0.1% | |
| 1326284970 | 1 | < 0.1% | |
| 1326276656 | 1 | < 0.1% | |
| 1326275049 | 1 | < 0.1% | |
| 1326274454 | 1 | < 0.1% |
review_overall
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.815580853 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 7 |
| Zeros (%) | < 0.1% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7206218681 |
|---|---|
| Coefficient of variation (CV) | 0.1888629532 |
| Kurtosis | 1.631038958 |
| Mean | 3.815580853 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -1.023968713 |
| Sum | 6053854 |
| Variance | 0.5192958767 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 4 | 582764 | 36.7% | |
| 4.5 | 324385 | 20.4% | |
| 3.5 | 301817 | 19.0% | |
| 3 | 165644 | 10.4% | |
| 5 | 91320 | 5.8% | |
| 2.5 | 58523 | 3.7% | |
| 2 | 38225 | 2.4% | |
| 1.5 | 12975 | 0.8% | |
| 1 | 10954 | 0.7% | |
| 0 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 7 | < 0.1% | |
| 1 | 10954 | 0.7% | |
| 1.5 | 12975 | 0.8% | |
| 2 | 38225 | 2.4% | |
| 2.5 | 58523 | 3.7% |
| Value | Count | Frequency (%) | |
| 5 | 91320 | 5.8% | |
| 4.5 | 324385 | 20.4% | |
| 4 | 582764 | 36.7% | |
| 3.5 | 301817 | 19.0% | |
| 3 | 165644 | 10.4% |
review_aroma
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.735636078 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4.5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.6976167288 |
|---|---|
| Coefficient of variation (CV) | 0.1867464374 |
| Kurtosis | 1.145196752 |
| Mean | 3.735636078 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.838530526 |
| Sum | 5927012.5 |
| Variance | 0.4866691003 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=9)
| Value | Count | Frequency (%) | |
| 4 | 557383 | 35.1% | |
| 3.5 | 365312 | 23.0% | |
| 4.5 | 271450 | 17.1% | |
| 3 | 200030 | 12.6% | |
| 2.5 | 66359 | 4.2% | |
| 5 | 64117 | 4.0% | |
| 2 | 42566 | 2.7% | |
| 1.5 | 12524 | 0.8% | |
| 1 | 6873 | 0.4% |
| Value | Count | Frequency (%) | |
| 1 | 6873 | 0.4% | |
| 1.5 | 12524 | 0.8% | |
| 2 | 42566 | 2.7% | |
| 2.5 | 66359 | 4.2% | |
| 3 | 200030 | 12.6% |
| Value | Count | Frequency (%) | |
| 5 | 64117 | 4.0% | |
| 4.5 | 271450 | 17.1% | |
| 4 | 557383 | 35.1% | |
| 3.5 | 365312 | 23.0% | |
| 3 | 200030 | 12.6% |
review_appearance
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.841641697 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 7 |
| Zeros (%) | < 0.1% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4.5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.6160927689 |
|---|---|
| Coefficient of variation (CV) | 0.160372262 |
| Kurtosis | 1.738866541 |
| Mean | 3.841641697 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.9024199172 |
| Sum | 6095202.5 |
| Variance | 0.3795702999 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 4 | 674186 | 42.5% | |
| 3.5 | 318529 | 20.1% | |
| 4.5 | 288108 | 18.2% | |
| 3 | 166009 | 10.5% | |
| 5 | 65398 | 4.1% | |
| 2.5 | 39493 | 2.5% | |
| 2 | 25414 | 1.6% | |
| 1.5 | 6147 | 0.4% | |
| 1 | 3323 | 0.2% | |
| 0 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 7 | < 0.1% | |
| 1 | 3323 | 0.2% | |
| 1.5 | 6147 | 0.4% | |
| 2 | 25414 | 1.6% | |
| 2.5 | 39493 | 2.5% |
| Value | Count | Frequency (%) | |
| 5 | 65398 | 4.1% | |
| 4.5 | 288108 | 18.2% | |
| 4 | 674186 | 42.5% | |
| 3.5 | 318529 | 20.1% | |
| 3 | 166009 | 10.5% |
| Distinct | 33387 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 348 |
| Missing (%) | < 0.1% |
| Memory size | 12.1 MiB |
| northyorksammy | 5817 |
|---|---|
| BuckeyeNation | 4661 |
| mikesgroove | 4617 |
| Thorpe429 | 3518 |
| womencantsail | 3497 |
| Other values (33382) |
| Value | Count | Frequency (%) | |
| northyorksammy | 5817 | 0.4% | |
| BuckeyeNation | 4661 | 0.3% | |
| mikesgroove | 4617 | 0.3% | |
| Thorpe429 | 3518 | 0.2% | |
| womencantsail | 3497 | 0.2% | |
| NeroFiddled | 3488 | 0.2% | |
| ChainGangGuy | 3471 | 0.2% | |
| brentk56 | 3357 | 0.2% | |
| Phyl21ca | 3179 | 0.2% | |
| WesWes | 3168 | 0.2% | |
| Other values (33377) | 1547493 | 97.5% |
Frequencies of value counts
Unique
| Unique | 10443 ? |
|---|---|
| Unique (%) | 0.7% |
Histogram of lengths of the category
Length
| Max length | 16 |
|---|---|
| Median length | 9 |
| Mean length | 8.961438636 |
| Min length | 3 |
| Distinct | 104 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 MiB |
| American IPA | 117586 |
|---|---|
| American Double / Imperial IPA | 85977 |
| American Pale Ale (APA) | 63469 |
| Russian Imperial Stout | 54129 |
| American Double / Imperial Stout | 50705 |
| Other values (99) |
| Value | Count | Frequency (%) | |
| American IPA | 117586 | 7.4% | |
| American Double / Imperial IPA | 85977 | 5.4% | |
| American Pale Ale (APA) | 63469 | 4.0% | |
| Russian Imperial Stout | 54129 | 3.4% | |
| American Double / Imperial Stout | 50705 | 3.2% | |
| American Porter | 50477 | 3.2% | |
| American Amber / Red Ale | 45751 | 2.9% | |
| Belgian Strong Dark Ale | 37743 | 2.4% | |
| Fruit / Vegetable Beer | 33861 | 2.1% | |
| American Strong Ale | 31945 | 2.0% | |
| Other values (94) | 1014971 | 64.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 35 |
|---|---|
| Median length | 18 |
| Mean length | 17.86997972 |
| Min length | 4 |
review_palate
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.743701367 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4.5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.6822183634 |
|---|---|
| Coefficient of variation (CV) | 0.1822309785 |
| Kurtosis | 1.303397287 |
| Mean | 3.743701367 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.8691499712 |
| Sum | 5939809 |
| Variance | 0.4654218953 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=9)
| Value | Count | Frequency (%) | |
| 4 | 606711 | 38.2% | |
| 3.5 | 338585 | 21.3% | |
| 4.5 | 253102 | 16.0% | |
| 3 | 206932 | 13.0% | |
| 2.5 | 62842 | 4.0% | |
| 5 | 62190 | 3.9% | |
| 2 | 38333 | 2.4% | |
| 1.5 | 11045 | 0.7% | |
| 1 | 6874 | 0.4% |
| Value | Count | Frequency (%) | |
| 1 | 6874 | 0.4% | |
| 1.5 | 11045 | 0.7% | |
| 2 | 38333 | 2.4% | |
| 2.5 | 62842 | 4.0% | |
| 3 | 206932 | 13.0% |
| Value | Count | Frequency (%) | |
| 5 | 62190 | 3.9% | |
| 4.5 | 253102 | 16.0% | |
| 4 | 606711 | 38.2% | |
| 3.5 | 338585 | 21.3% | |
| 3 | 206932 | 13.0% |
review_taste
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.792860456 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7319696099 |
|---|---|
| Coefficient of variation (CV) | 0.1929861692 |
| Kurtosis | 1.341669306 |
| Mean | 3.792860456 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.9734324438 |
| Sum | 6017805.5 |
| Variance | 0.5357795098 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=9)
| Value | Count | Frequency (%) | |
| 4 | 541429 | 34.1% | |
| 4.5 | 336162 | 21.2% | |
| 3.5 | 324541 | 20.5% | |
| 3 | 166860 | 10.5% | |
| 5 | 83977 | 5.3% | |
| 2.5 | 66534 | 4.2% | |
| 2 | 41992 | 2.6% | |
| 1.5 | 15128 | 1.0% | |
| 1 | 9991 | 0.6% |
| Value | Count | Frequency (%) | |
| 1 | 9991 | 0.6% | |
| 1.5 | 15128 | 1.0% | |
| 2 | 41992 | 2.6% | |
| 2.5 | 66534 | 4.2% | |
| 3 | 166860 | 10.5% |
| Value | Count | Frequency (%) | |
| 5 | 83977 | 5.3% | |
| 4.5 | 336162 | 21.2% | |
| 4 | 541429 | 34.1% | |
| 3.5 | 324541 | 20.5% | |
| 3 | 166860 | 10.5% |
| Distinct | 56857 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 MiB |
| 90 Minute IPA | 3290 |
|---|---|
| India Pale Ale | 3130 |
| Old Rasputin Russian Imperial Stout | 3111 |
| Sierra Nevada Celebration Ale | 3000 |
| Two Hearted Ale | 2728 |
| Other values (56852) |
| Value | Count | Frequency (%) | |
| 90 Minute IPA | 3290 | 0.2% | |
| India Pale Ale | 3130 | 0.2% | |
| Old Rasputin Russian Imperial Stout | 3111 | 0.2% | |
| Sierra Nevada Celebration Ale | 3000 | 0.2% | |
| Two Hearted Ale | 2728 | 0.2% | |
| Arrogant Bastard Ale | 2704 | 0.2% | |
| Stone Ruination IPA | 2704 | 0.2% | |
| Sierra Nevada Pale Ale | 2587 | 0.2% | |
| Stone IPA (India Pale Ale) | 2575 | 0.2% | |
| Pliny The Elder | 2527 | 0.2% | |
| Other values (56847) | 1558258 | 98.2% |
Frequencies of value counts
Unique
| Unique | 18908 ? |
|---|---|
| Unique (%) | 1.2% |
Histogram of lengths of the category
Length
| Max length | 75 |
|---|---|
| Median length | 19 |
| Mean length | 20.45317513 |
| Min length | 1 |
| Distinct | 530 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 67785 |
| Missing (%) | 4.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.042386753 |
|---|---|
| Minimum | 0.01 |
| Maximum | 57.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 4.5 |
| Q1 | 5.2 |
| median | 6.5 |
| Q3 | 8.5 |
| 95-th percentile | 11 |
| Maximum | 57.7 |
| Range | 57.69 |
| Interquartile range (IQR) | 3.3 |
Descriptive statistics
| Standard deviation | 2.322525993 |
|---|---|
| Coefficient of variation (CV) | 0.3297924516 |
| Kurtosis | 6.961811545 |
| Mean | 7.042386753 |
| Median Absolute Deviation (MAD) | 1.5 |
| Skewness | 1.543406148 |
| Sum | 10696181.23 |
| Variance | 5.394126987 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5 | 109144 | 6.9% | |
| 8 | 67744 | 4.3% | |
| 6 | 65383 | 4.1% | |
| 7 | 59460 | 3.7% | |
| 9 | 59183 | 3.7% | |
| 5.5 | 59010 | 3.7% | |
| 10 | 54780 | 3.5% | |
| 6.5 | 48369 | 3.0% | |
| 5.2 | 43268 | 2.7% | |
| 7.5 | 39978 | 2.5% | |
| Other values (520) | 912510 | 57.5% | |
| (Missing) | 67785 | 4.3% |
| Value | Count | Frequency (%) | |
| 0.01 | 5 | < 0.1% | |
| 0.05 | 17 | < 0.1% | |
| 0.08 | 1 | < 0.1% | |
| 0.1 | 11 | < 0.1% | |
| 0.25 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 57.7 | 1 | < 0.1% | |
| 43 | 2 | < 0.1% | |
| 41 | 76 | < 0.1% | |
| 39.44 | 3 | < 0.1% | |
| 39 | 7 | < 0.1% |
beer_beerid
Real number (ℝ≥0)
| Distinct | 66055 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21712.79428 |
|---|---|
| Minimum | 3 |
| Maximum | 77317 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 213 |
| Q1 | 1717 |
| median | 13906 |
| Q3 | 39441 |
| 95-th percentile | 62653 |
| Maximum | 77317 |
| Range | 77314 |
| Interquartile range (IQR) | 37724 |
Descriptive statistics
| Standard deviation | 21818.336 |
|---|---|
| Coefficient of variation (CV) | 1.004860808 |
| Kurtosis | -0.8339342225 |
| Mean | 21712.79428 |
| Median Absolute Deviation (MAD) | 13217 |
| Skewness | 0.6893969312 |
| Sum | 3.444982338e+10 |
| Variance | 476039785.7 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2093 | 3290 | 0.2% | |
| 412 | 3111 | 0.2% | |
| 1904 | 3000 | 0.2% | |
| 1093 | 2728 | 0.2% | |
| 92 | 2704 | 0.2% | |
| 4083 | 2704 | 0.2% | |
| 276 | 2587 | 0.2% | |
| 88 | 2575 | 0.2% | |
| 7971 | 2527 | 0.2% | |
| 11757 | 2502 | 0.2% | |
| Other values (66045) | 1558886 | 98.3% |
| Value | Count | Frequency (%) | |
| 3 | 3 | < 0.1% | |
| 4 | 10 | < 0.1% | |
| 5 | 424 | < 0.1% | |
| 6 | 877 | 0.1% | |
| 7 | 659 | < 0.1% |
| Value | Count | Frequency (%) | |
| 77317 | 1 | < 0.1% | |
| 77316 | 1 | < 0.1% | |
| 77315 | 1 | < 0.1% | |
| 77314 | 1 | < 0.1% | |
| 77313 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| brewery_id | brewery_name | review_time | review_overall | review_aroma | review_appearance | review_profilename | beer_style | review_palate | review_taste | beer_name | beer_abv | beer_beerid | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10325 | Vecchio Birraio | 1234817823 | 1.5 | 2.0 | 2.5 | stcules | Hefeweizen | 1.5 | 1.5 | Sausa Weizen | 5.0 | 47986 |
| 1 | 10325 | Vecchio Birraio | 1235915097 | 3.0 | 2.5 | 3.0 | stcules | English Strong Ale | 3.0 | 3.0 | Red Moon | 6.2 | 48213 |
| 2 | 10325 | Vecchio Birraio | 1235916604 | 3.0 | 2.5 | 3.0 | stcules | Foreign / Export Stout | 3.0 | 3.0 | Black Horse Black Beer | 6.5 | 48215 |
| 3 | 10325 | Vecchio Birraio | 1234725145 | 3.0 | 3.0 | 3.5 | stcules | German Pilsener | 2.5 | 3.0 | Sausa Pils | 5.0 | 47969 |
| 4 | 1075 | Caldera Brewing Company | 1293735206 | 4.0 | 4.5 | 4.0 | johnmichaelsen | American Double / Imperial IPA | 4.0 | 4.5 | Cauldron DIPA | 7.7 | 64883 |
| 5 | 1075 | Caldera Brewing Company | 1325524659 | 3.0 | 3.5 | 3.5 | oline73 | Herbed / Spiced Beer | 3.0 | 3.5 | Caldera Ginger Beer | 4.7 | 52159 |
| 6 | 1075 | Caldera Brewing Company | 1318991115 | 3.5 | 3.5 | 3.5 | Reidrover | Herbed / Spiced Beer | 4.0 | 4.0 | Caldera Ginger Beer | 4.7 | 52159 |
| 7 | 1075 | Caldera Brewing Company | 1306276018 | 3.0 | 2.5 | 3.5 | alpinebryant | Herbed / Spiced Beer | 2.0 | 3.5 | Caldera Ginger Beer | 4.7 | 52159 |
| 8 | 1075 | Caldera Brewing Company | 1290454503 | 4.0 | 3.0 | 3.5 | LordAdmNelson | Herbed / Spiced Beer | 3.5 | 4.0 | Caldera Ginger Beer | 4.7 | 52159 |
| 9 | 1075 | Caldera Brewing Company | 1285632924 | 4.5 | 3.5 | 5.0 | augustgarage | Herbed / Spiced Beer | 4.0 | 4.0 | Caldera Ginger Beer | 4.7 | 52159 |
Last rows
| brewery_id | brewery_name | review_time | review_overall | review_aroma | review_appearance | review_profilename | beer_style | review_palate | review_taste | beer_name | beer_abv | beer_beerid | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1586604 | 14359 | The Defiant Brewing Company | 1288890206 | 4.0 | 4.5 | 4.5 | njmoons | Pumpkin Ale | 3.5 | 3.5 | The Horseman's Ale | 5.2 | 33061 |
| 1586605 | 14359 | The Defiant Brewing Company | 1163291143 | 5.0 | 5.0 | 5.0 | NyackNicky | Pumpkin Ale | 5.0 | 5.0 | The Horseman's Ale | 5.2 | 33061 |
| 1586606 | 14359 | The Defiant Brewing Company | 1162871808 | 5.0 | 4.5 | 4.0 | blitheringidiot | Pumpkin Ale | 5.0 | 5.0 | The Horseman's Ale | 5.2 | 33061 |
| 1586607 | 14359 | The Defiant Brewing Company | 1162865640 | 5.0 | 5.0 | 4.5 | PopeDX | Pumpkin Ale | 5.0 | 4.5 | The Horseman's Ale | 5.2 | 33061 |
| 1586608 | 14359 | The Defiant Brewing Company | 1162685856 | 3.5 | 4.0 | 4.0 | treehugger02010 | Pumpkin Ale | 3.5 | 3.0 | The Horseman's Ale | 5.2 | 33061 |
| 1586609 | 14359 | The Defiant Brewing Company | 1162684892 | 5.0 | 4.0 | 3.5 | maddogruss | Pumpkin Ale | 4.0 | 4.0 | The Horseman's Ale | 5.2 | 33061 |
| 1586610 | 14359 | The Defiant Brewing Company | 1161048566 | 4.0 | 5.0 | 2.5 | yelterdow | Pumpkin Ale | 2.0 | 4.0 | The Horseman's Ale | 5.2 | 33061 |
| 1586611 | 14359 | The Defiant Brewing Company | 1160702513 | 4.5 | 3.5 | 3.0 | TongoRad | Pumpkin Ale | 3.5 | 4.0 | The Horseman's Ale | 5.2 | 33061 |
| 1586612 | 14359 | The Defiant Brewing Company | 1160023044 | 4.0 | 4.5 | 4.5 | dherling | Pumpkin Ale | 4.5 | 4.5 | The Horseman's Ale | 5.2 | 33061 |
| 1586613 | 14359 | The Defiant Brewing Company | 1160005319 | 5.0 | 4.5 | 4.5 | cbl2 | Pumpkin Ale | 4.5 | 4.5 | The Horseman's Ale | 5.2 | 33061 |